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Description 

[0001] The present invention relates to newly identified EGD8 receptors, polynucleotides encoding this receptor, 
polypeptides encoded by such polynucleotides the preparation and the use of thereof. 

[0002] In an effort to identify new G-protein coupled receptors of the EDG (endothelial differentiation gene)-family a 
novel member of the EDG-family of G-protein coupled receptors, Human EDG8, was identified. The full-length clone 
was isolated and studies on chromosomal mapping, tissue expression and identification as a functional cellular receptor 
for sphingosine 1-phosphatea were performed. Taken together, the data provide compelling evidence that EDG8 is 
the fifth receptor for sphingosine 1-phosphate, exclusively expressed in peripheral tissues, its presence in endothelial 
cells being responsible for the broad tissue distribution. 

[0003] The lysolipid phosphate mediators lysophosphatidic acid (LPA) and sphingosin 1-phosphate (S1 P) have at- 
tracted increasing attention as modulators of a variety of important biological functions (Moolenaar et ai, 1 997; Morris, 
1999; Lynch, 1999) and their list of biological activities is continuously growing. 

Among the biological responses to LPA is platelet aggregation (Durieux and Lynch, 1993; Moolenaar, 1994; Jalink et 
ai., 1994; Siess et al., 1999; Gueguen etal., 1999), smooth muscle contraction, in vivo vasoactive effects, chemotaxis, 
expression of adhesion molecules on endothelial cells, increased tight junction permeability, activation of membrane 
ion channels and many others. The biochemical signalling events that mediate the cellular effects of LPA include stim- 
ulation of phospholipases, mobilization of intracellular Ca2+, inhibition of adenylyl cyclase, activation of phosphatidyli- 
nositol 3-kinase, activation of the Ras-Raf-MAP kinase cascade and stimulation of Rho-GTPases.(Moolenaar et al., 
1997) 

S1P, in particular, is implicated in cell proliferation, induction/suppression of apoptosis, modulation of cell motility, an- 
giogenesis, tumor invasiveness, platelet activation and neurite retraction. Cellular signalling by S1 P involves activation 
of PLCp and subsequent intracellular Ca 2+ release, activation of MAP-kinases, activation of inward rectifying K + -chan- 
nels and inhibition and/or activation of adenylyl cyclase. 

Both, LPA and S1P are recognized to signal cells through a set of G-protein coupled receptors (GPCRs) known as 
EDG (endothelial differentiation genej-receptors. The EDG-family of GPCRs currently comprises seven human mem- 
bers (EDG1 -7) that fall into two major groups depending on their preference for the activating lipid-ligand: EDG1 , 3, 5 
and 6 preferentially interact with S1P, EDG2, 4 and 7 preferentially interact with LPA. 

The assignment of specific biological functions to certain receptor subtypes is hampered by the fact that EDG receptors 
are expressed in an overlapping fashion, they activate multiple and in part identical signal transduction pathways, the 
selectivity for their activating ligands is not absolute, and medicinal chemistry is only poorly developed in that specific 
antagonists for dissecting the pharmacology of the individual subtypes are not available yet. 
An important step to shed more light on the biological role of the individual receptor subtypes would be to identify the 
complete set of receptors that respond to the phospholipid mediators S1P and LPA. 

[0004] The present invention relates to newly identified EGD8 receptors, polynucleotides encoding this receptor, 
polypeptides encoded by such polynucleotides the preparation and the use of thereof. 

[0005] The present invention relates to an isolated polynucleotide comprising a nucleotide sequence that has at least 
80 % identity to a nucleotide sequence encoding the polypeptide of SEQ ID NO. 2 or the corresponding fragment 
thereof; or a nucleotide sequence complementary to said nucleotide sequence. 
[0006] Preferably, the polynucleotide is DNA or RNA. 

Preferably, the nucleotide sequence of the polynucleotide is at least 80 % identical to that contained in SEQ ID NO. 1 . 
In another embodiment, the polynucleotide has the nucleotide sequence SEQ ID NO. 1 . In another embodiment, the 
polynucleotide encodes the polypeptide of SEQ ID NO. 2 or a fragment thereof. 

[0007] Another aspect to the invention relates to an expression system for the expression of EDG8. The EDG8 DNA 
or RNA molecule comprising an expression system wherein said expression system is capable of producing a polypep- 
tide or a fragment thereof having at least 80 % identity with a nucleotide sequence encoding the polypeptide of SEQ 
ID NO. 2 or said fragment when said expression system is present in a compatible host cell. 
The invention relates to a host cell comprising the expression system. 

[0008] In another aspect, the invention relates to a process for producing a EDG8 polypeptide or fragment comprising 
culturing a host cell comprising the expression system under conditions sufficient for the production of said polypeptide 
or fragment. Preferably, the said polypeptide or fragment is expressed at the surface of said cell. The invention relates 
also to cells produced by this process. 

The process preferably further includes recovering the polypeptide or fragment from the culture. 
[0009] In another aspect, the invention relates to a process for producing a cell which produces a EDG8 polypeptide 
or a fragment thereof comprising transforming or transfecting a host cell with the expression system such that the host 
cell, under appropriate culture conditions, produces a EDG8 polypeptide or fragment. 

[0010] In particular, the invention relates to an EDG8 polypeptide or a fragment thereof comprising an amino acid 
sequence which is at least 80 % identical to the amino acid sequence contained in SEQ ID NO. 2; in particular to an 
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EDG8 polypeptide or a fragment thereof having amino acid sequence SEQ ID NO. 2 or a part thereof. 

[0011] Further, the invention relates to a process for diagnosing a disease or a susceptibility to a disease related to 

expression or acitivity of EDG8 polypeptide comprising: 

5 a) determining the presence or absence of mutation in the nucleotide sequence encoding said EDG8 polypeptide 

in the genome of said subject; and/or 

b) analyzing for the presence or amount of the EDG8 polypeptide expression in a sample derived from said subject. 

[0012] In addition, the invention relates to a method for identifying compounds which bind to EDG8 polypeptide 
10 comprising: 

a) contacting a cell comprising the expression system or a part of such a cell with a candidate compound; and 

b) assessing the ability of said candidate compound to bind to said cells. 

15 [0013] Preferably, the method further includes determining whether the candidate compound effects a signal gener- 
ated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate compound which effects 
production of said signal is identified as an agonist. 

[0014] In another embodiment of the invention, the method further includes determining whether the candidate com- 
pound effects a signal generated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate 
20 compound which effects production of said signal is identified as an antagonist. 

[0015] The invention also relates to an agonist or antagonist identified by such methods. 

[0016] In another special embodiment of the method, the mehtod further includes contacting said cell with a known 
agonist for said EDG8 polypeptide; and determining whether the signal generated by said agonist is diminished in the 
presence of said candidate compound, wherein a candidate compound which effects a diminution in said signal is 
25 identified as an antagonist for said EDG8 polypeptide. The known agonist is for example S1 P, LPA and/or DHS1P. The 
invention also relates to an antagonist identified by the method. 

[0017] The invention in addition, relates to a method of preparing a pharmaceutical composition comprising 

a) identifying a compound which is an agonist or an antagonist of EDG8, 
30 b) preparing the compound, and 

c) optionally mixing the compound with suitable additives. 

[0018] The invention also relates to a pharmaceutical compound prepared by such a process. 

[0019] In the study, we report about the cloning, chromosomal mapping, tissue expression and functional identifica- 

35 tion as a receptor for S1 P of a novel GPCR, EDG8, the fifth functional receptor for sphingosine 1-phosphate. 

[0020] In an effort to identify new G-protein coupled receptors of the EDG-family a database search with alignments 
of the currently known 18 members of this receptor family, comprising human EDG1-7 sequences up to the putative 
EDGs from Xenopus and Zebra-fish was performed. A multiple alignment of these sequences was created by CLUS- 
TALW and used in a PSI-BLAST search to scan translated versions of human genomic DNA sequences, which were 

40 publicly available in the different EMBL sections. For translation of DNA into protein sequences, individual protein files 
within two respective STOP-codons were created and all proteins shorter than 50 amino acids were ignored. As the 
majority of GPCRs is unspliced searching for GPCRs within genomic sequences should bring about novel receptor 
proteins. 

Performing a PSI-BLAST search, the various cDNAs and genomic contigs, respectively, for the human EDG1-7 recep- 
45 tors were identified, and an additional genomic hit with a high e-value, that was not identical to any of the published 
EDG-sequences. The nucleotide and amino acid sequence of the new putative GPCR are depicted in Fig.lA. Hydrop- 
athy analysis (hydrophobicity plot not shown) suggests a seven transmembrane protein with three alternating extra- 
and intracellular loops, assumed to be the heptahelix structure common to GPCRs. 

To shed more light on the relationships involved in the molecular evolution of the EDG-receptor family, a grow tree 
50 phylogram was constructed using the neighbor joining method (GCG software) (Fig.lB) According to this phylogenetic 
tree, the human EDG-family can be divided into two distinct groups: EDG1, 3, 5 and 6 belonging to one, EDG2, 4 and 
7 belonging to the other group. These two groups are discriminated further by their preference for different lipid ligands: 
EDG1, 3, 5, 6 are preferentially stimulated by sphingosin 1-phosphate (S1P), EDG2, 4 and 7 by lysophosphatidic acid 
(LPA). The newly identified sequence was named EDG8 for reasons of consistency with the existing human EDG- 
55 family nomenclature and exhibited highest similarity (86.8%) to the rat nrgl-protein (Fig. 1B), a GPCR recently cloned 
by EST-expression profiling from a rat PC1 2 cell library (Glickman et al., 1999). The high similarity between EDG8 and 
the known sphingosin 1-phosphate (S1P) receptors EDG1, 3 and 5 (48-51%) (Fig. 1C) led us to test the hypothesis 
that EDG8 may be a functional S1P-receptor. 



3 



EP1 149 907 A1 



In testing forS1 P receptor activity, the EDG8 cDNA was introduced into Chinese hamster ovary (CHO) cells by transient 
transfection. CHO cells were chosen as they exhibit minimal responses to sphingosin 1 -phosphate in concentrations 
up to 1 u.M but respond to S1P after transfection with the S1P preferring receptors EDG 1, 3 and 5 (Okamoto et al., 
1998; Kon et al., 1999). To test functional receptor activity, it was decided to monitor the mobilization of intracellular 
5 Ca2+ for three reasons: 

1. ) S1P has been reported to increase Ca2+ in many cell types; 

2. ) the identification of EDG1, 3, 5 and 6 as receptors for S1P has provided the molecular basis for a GPCR 
mediated mechanism and the receptors are known to mediate intracellular Ca2+ through either PTX-sensitive Gi 

10 proteins or the PTX-insensitive Gaq/11 pathway; 

3. ) all currently known S IP-responding EDG-receptors (except EDG6) are present in endothelial cells (A. Nieder- 
nberg et al., submitted), in which intracellular Ca2+ release is a major pathway in the generation of NO, an important 
factor in vascular biology. 

15 Thus, identification of the complete set of S1 P receptors, involved in intracellular Ca2+ mobilization could help clarify 
the role of the individual subtypes in endothelial cell signalling. 

[0021] Fig.2 depicts measurement of the intracellular Ca2+ concentration, mediated by S1P via the different S1 P- 
receptors EDG1, 3, 5, 6 and the putative S1P-receptor EDG8, cotransfected in CHO cells with empty vector DNA as 
a control or various G-protein a subunits. Intracellular Ca2+ concentrations were recorded as real time measurements 

20 using the Fluorescence plate imaging reader (FLIPR, Molecular Devices). As has been reported for EDG1, 3 and 5, 
S1P elicited intracellular Ca2+ signals that did not require cotransfection of a G-protein a subunit. As already known 
for a large number of Gq-coupled receptors, coexpression of Gaq augments the EDG1 and 5-mediated Ca2+-release 
as compared with the Ca2+ signal induced by stimulation of endogeneous Gaq. In case of EDG3, additional exoge- 
neously added Gaq did not further improve the signal intensity. In case of EDG6, Yamazaki Y. et al. (2000) reported 

25 an S1 P-induced mobilization of intracellular Ca2+ but we failed to detect a significant Ca2+ increase above basal levels 
in the absence of any cotransfected G-protein a subunit. The reason for this discrepancy could be the cellular back- 
ground, as Yamazaki Y. et al. (2000) reported that the Ca2+ signal can be completely abolished in the presence of 
pertussis toxin (PTX), indicating the involvement of Gi-type G-proteins. In this case the Ca2+ signal would be elicited 
by bg, released from activated Gaibg heterotrimers. The Gai-induced Ca2+ signals are known to be much smaller in 

30 intensity as compared with the Ca2+ signals induced by bona-fide Gq-linked receptors (Kostenis et al., 1 997a; Kostenis 
et al., 1997b). It may be that detection of such Ca2+ concentrations is beyond the sensitivity of the FLIPR system. 
EDG8 did not release intracellular Ca2+ when stimulated with S1 P (Fig.2), but gained the ability to mobilize Ca2+ upon 
cotransfection with Ga16, a G-protein oc-subunit, known to couple GPCRs from different functional classes to the Gq- 
PLCp pathway or Gctqi5, a mutant G-protein a subunit that confers onto Gi-liked receptors the ability to stimulate Gq. 

35 These results show that EDG8 as opposed to EDG 3 and 5 is not a bona-fide Gq-coupled receptor but a fuctional 
cellular receptor for S1P. To check, whether the EDG8 receptor also reacts to related lysophospholipid mediators, we 
examined the abilities of lysophosphatidic acid (LPA), dihydrosphingosin 1-phosphate (DHS1P), sphingosylphospho- 
rylcholine (SPC) and lysophosphatidylcholine (LPC) to increase intracellular Ca2+ in CHO cells transiently transfected 
with the EDG8 receptor and the G-protein a subunits G16 and Gqi5 (Fig.3). Besides S1P, which was the most potent 

40 activator of EDG8, LPA and DHS1P evoked [Ca2+]i increases in concentrations of 100 and 1000 nM. SPC and LPC, 
respectively, failed to generate any significant response in concentrations up to 1 uM.. These data show that EDG8 is 
a S1P preferring receptor, but also responds to related phospholipids like DHS1P or LPA, as has also been reported 
for EDG1, which is a high affinity receptor for S1P and a low affinity receptor for LPA. 

[0022] Next, the expression pattern of the human EDG8 gene in human tissues was investigated by Northern blot 
45 analysis (Fig.4). Northern blot analysis shows EDG8 expression in several peripheral tissues. Tissues positive for 
EDG8 RNA were skeletal muscle, heart and kidney, lower abundance of RNA was seen in liver and placenta, no signal 
was obtained in brain, thymus, spleen, lung and peripheral blood leukocytes. In all tissues a single RNA transcript of 
5.5 kb was observed after hybridization with a DIG-labelled EDG8 antisense RNA probe. EDG8 exhibits highest sim- 
ilarity to the rat nrgl-GPCR with an amino acid identity of 86.8% (Fig. 1 B) suggesting that it may be the human homolog 
50 of the rat nrgl protein. However, the expression pattern of human EDG 8 is quite different from the rat nrg1 -receptor, 
which is found almost exclusively in brain. This finding suggests that EDG8 may represent a closely related but entirely 
different receptor from nrgl, rather than the human homolog. Never the less, it does not rule out the possibility that 
EDG8 and nrgl are homologs with entirely different, species-dependent expression patterns. 
[0023] As the first member of the EDG-family of GPCRs was cloned as an immediate early gene induced during the 
55 morphogenetic differentiation phase of angiogenesis (Hla and Maciag, 1990) and subsequently cloned from a human 
umbilical vein endothelial cell library exposed to fluid shear stress as an upregulated gene it is reasonable to assume 
that EDG receptors play an important role in the regulation of endothelial function. Therefore, the presence of EDG8 
transcripts in several human endothelial cell lines was analyzed. RT-PCR analysis of human umbilival vein endothelial 
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cells (HUVECs), human coronary artery endothelial cells (HCAECs), human microvascular endothelial cells of the lung 
(HMVEC-L) and human pulmonary artery endothelial cells (HPAEC) revealed EDG8 expression in all cell lines tested 
(Fig.5A). In Fig.SB it is shown that EDG8 specific primers indeed solely amplify EDG8 sequences and none of the 
related EDG1-7 sequences. These findings suggest that the presence of EDG8 in different peripheral organs may be 
5 due to its localization in endothelial cells; it does not rule out, however, that EDG8 transcripts occur in cell types other 
than endothelial cells. 

[0024] The expression of EDG8 in addition to EDG1, 3, and 5 (Rizza et al., 1999) in HUVECS and several other 
endothelial cell lines is intriguing in view of all the reports regarding S1P effects on endothelial cell signalling. Hisano 
et al. (1 999) reported that S1 P protects HUVECS from apoptosis induced by withdrawal of growth factors and stimulates 

10 HUVEC DNA synthesis; the authors derived a model for cell-cell interactions between endothelial cells and platelets 
but the S1 P-receptor responsible for HUVEC- protection of apoptosis could not be identified. Rizza et al., 1999 reported 
that S1 P plays a role in endothelial cell leukocyte interaction in that S1P induces expression of cell adhesion molecules 
in human aortic endothelial cells, allowing monocytes and neutrophils to attach. These effects were blocked by pertussis 
toxin, suggesting the involvement of a Gi-coupled S1P receptor. The responsible S1 P-receptor subtype, however, 

15 could not be identified and the EDG8 receptor was not included at the time of this study. Expression profiling of all 
EDG receptors in individual cell lines and the use of EDG receptor subtype selective compounds will clearly be nec- 
essary to help determine the role of the individual S1P receptors in endothelial cell signalling mechanisms. 
[0025] Finally, the mapping of EDG receptors in genomic sequences allowed to allocate a map position for four genes 
of this family (Tab.1). Interestingly, so far, four EDG-receptors including EDG8 are located on chromosome 19. 

20 In conclusion, we isolated a new member of the EDG-family of G-protein coupled receptors, EDG8, and showed that 
it functions as a cellular receptor for sphingosine 1 -phosphate. EDG8 could exclusively be detected in peripheral tissues 
like skeletal muscle, heart and kidney and several human endothelial cell lines. It is conceivable that the expression 
in endothelial cells may account for the broad tissue distribution of this receptor. The existence of at least eight EDG- 
receptors for lysophospholipids suggests that receptor subtype selective agonists and antagonists will essentially be 

25 necessary for a better understanding of the biology of lysophospholipids and their respective receptors. 

Figure legends 

[0026] Fig.lA: The nucleotide and deduced amino acid sequence of human EDG8. The deduced amino acid se- 
30 quence is shown below the nucleotide sequence with the nucleotide positions indicated on the left. The putative seven 
transmembrane domains are underlined. 

[0027] Fig. 1 B: Phylogenetic tree of the EDG-family of receptors. The phylogenetic tree depicted was derived by the 
neighbor joining method method performed with the GCG program. 

[0028] Fig. 1C: Alignment of the amino acid sequence of human EDG8 with the other EDG-family members. The 
35 amino acid sequence of EDG8 is compared with the EDG1-7 polypeptides (EDG1: accession number M 31210, EDG2: 
accession number U 80811 , EDG3: accession numberX 83864, EDG4: accession number af 011466, EDGS: accession 
number af 034780, EDG7: accession number af 127138). The approximate boundaries of the seven putative trans- 
membrane domains are boxed. Gaps are introduced to optimize the alignment. 

[0029] Fig.2: Mobilization of intracellular Ca 2+ by S1P mediated by the EDG1, 3, 5 and 8 receptor in CHO cells, 
40 cotransfected with empty vector DNA as a control or the indicated G-protein a subunits. Agonist-mediated changes of 
intracellular Ca 2+ were measured with the FLIPR using the Ca 2+ -sensitive dye FLU04. Fluorescence of transfected 
cells loaded with FLU04 was recorded before and after addition of S1 P, applied in the indicated concentrations. Data 
are expressed as means of quadruplicate determinations in a single experiment. An additional experiment gave similar 
results. 

45 [0030] Fig.3: Effects of S1 P, LPA and related lysophospholipid mediators on EDG8-mediated increase in intracellular 
Ca 2+ measured with the FLIPR as described in the legend of Fig.4. The different lipids were applied in concentrations 
of 10, 100 and 1000 nM, respectively. Data are means of quadruplicate determinations of a representative experiment. 
Two additional experiments gave similar results. 

[0031] Fig.4: Northern blot analysis of EDG8 in human tissues. Poly(A)+ RNA (1ug) from various human tissues 
50 (human multiple tissue Northern blots, CLONTECH) was hybridized with probes specific to human EDG8 (upper panel) 
and glyceraldehyd-3-phosphate dehydrogenase, GAPDH, (lower panel) on a nylon membrane. The origin of each RNA 
is indicated at the top, the molecular mass of standard markers in kilobases (kb) is shown on the left. 
[0032] Fig.5A: Reverse transcriptase-polymerase chain reaction (RT-PCR) analysis of EDG8 in different human en- 
dothelial cell lines (HUVECS: human umbilical vein endothelial cells; HCAEC: human coronary artery endothelial cells; 
55 HMVEC-L: human microvascular endothelial cells from lung; HPAEC: human pulmonary artery endothelial cells). 
EDG8-specific transcripts were detected in all endothelial ceil lines. Agarose gel electrophoresis of the PCR products 
after 35 cycles of amplification is shown. Amplification with EDG8-specific primers yields a 522 bp EDG8-fragment as 
indicated by the arrow. 
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[0033] Fig.5B: PCR analysis of EDG8 primers for specificity of amplification of EDG8 sequences. Primers, specific 
for the EDG8 sequence, were checked for potential amplification of the related EDG1-7 sequences. The EDG8 specific 
522 bp band occurred only when EDG8 was used as a template. 



Table 1: 



Chromosomal localization of EDG-receptors 1-8. 

Mapping of EDG receptors to genomic sequences allowed to derive a chromosomal assignment for EDG4, 5, 6 and 
8. The chromosomal localization of EDG 1-3 was obtained from the genecards datacollection, 


EDG 


Chromosomal localisation 


according BAC AccNr.: 


EDG1 


1p21.1-21.3 


AL161741 


EDG2 


9q31.1-32//18p11.3 


AL1 57881/ /AP000882 


EDG3 


9q22.1-q22.2 




EDG4 


19p12 


NT_000939 


EDG5 


19 


AC011511 


EDG6 


19p13.3 


AC011547 


EDG7 


1p22.3-31.2 


AL1 39822 


EDG8 


19 


AC011461 



Examples 



Example 1: Molecular cloning of the human EDG8 receptor. 

[0034] As the putative human EDG8 sequence is intronless. we cloned the receptor from human genomic DNA 
(CLONTECH, Palo Alto, CA, 94303-4230) via polymerase chain reaction (PCR). PCR conditions, established to amplify 
the EDG8 sequence were 94°C, 1 min followed by 35 cycles of 94°C, 30sec, 68C, 3 min, using GC-Melt Kit (CLON- 
30 TECH, Palo Alto, CA). Primers designed to amplify the EDG8 sequence contained a Hindlll site in the forward, and a 
EcoRI site in the reverse primer, respectively. The 1197 bp PCR product was cloned into the pCDNA3.1(+) mammalian 
expression vector (Invitrogen, Carlsbad, California) and sequenced in both directions. 

Example 2: Cell culture and Transfection. 

35 

[0035] CHO-K1 cells were grown in basal ISCOVE medium supplemented with 10% fetal bovine serum at 37°C in 
a humidified 5% C02 incubator. For transfections, 2 x 1 0 5 cells were seeded into 35-mm dishes. About 24 hr later cells 
were transiently transfected at 50-80% confluency with the indicated receptor and G-protein constructs (1 ug of plasmid 
DNA each) using the Lipofectamine transfection reagent and the supplied protocol (GIBCO). 1 8-24 hr after transfection 
40 cells were seeded into 96well plates at a density of 50.000 cells per well and cultured for 1 8-24 additional hr until used 
in the functional FLIPR assays. 

The cDNA for Ga16 was cloned from TF1 cells by RT-PCR and ligated into the pCDNA1.1 mammalian expression 
vector (Invitrogen). Murine wild type Gaq was cloned from cells by RT-PCR and inserted into the BamHI-A/s/l-sites of 
pCDNA1.1. To create the C-terminally modified Ga qi5 subunit, in which the last five aa of wt Gaq were replaced with 
45 the correspoding Goij sequence, a 175-bp Bgl\\-Nsi\ fragment was replaced, in a two piece ligation, with a synthetic 
DNA fragment, containing the desired codon changes. The correctness of all PCR-derived sequences was verified by 
sequencing in both directions. 

Example 3: Fluorometric Imaging Plate Reader (FLIPR) Assay. 

50 

[0036] Twenty-four hours after transfection, cells were splitted into 96-well, black-wall microplates (Corning) at a 
density of 50,000 cells per well. 1 8-24 hr later, cells were loaded with 95ul of HBSS containing 20 mM Hepes, 2.5 mM 
probenecid, 4 uM fluorescent calcium indicator dye Fluo4 (Molecular Probes) and 1 % fetal bovine serum for 1 h(37°C, 
5% C0 2 ). Cells were washed three times with HBSS containing 20 mM Hepes and 2.5 mM probenecid in a cell washer. 
55 After the final wash, the solution was aspirated to a residual volume of 1 00 ul per 96 well. Lipid ligands were dissolved 
in DMSO as 2 mM stock solutions (treated with ultrasound when necessary) and diluted at least 1:100 into HBSS 
containing 20 mM HEPES, 2.5 mM probenecid and 0.4 mg/ml fatty acid free bovine serum albumine. Lipids were 



6 



EP 1 149 907 A1 



aliquoted as 2X solutions into a 96 well plate prior to the assay. The fluorometric imaging plate reader (FLIPR, Molecular 
Devices) was programmed to transfer 100 ul from each well of the ligand microplate to each well of the cellplate and 
to record fluorescence during 3 m in in 1 second intervals during the first minute and 3 second intervals during the last 
two minutes. Total fluorescence counts from the 18-s to 37-s time points are used to determine agonist activity. The 
5 instrument software normalizes the fluorescent reading to give equivalent initial readings at time zero. 

Example 4: Northern Blot analysis. 

[0037] Human multiple tissue Northern blots were purchased from CLONTECH (Palo Alto, CA, 94303-4230, USA) 
10 antisense RNA probes were generated by subcloning nucleotides 279-1197 of the coding region into the Bam Hl-Eco 
Rl sites of the expression vector PSPT18 (Roche Diagnostics, Mannheim, Germany) and subsequent random priming 
with a DIG-RNA Labeling kit (Roche Diagnostics, Mannheim, Germany), using T7 RNA polymerase. Hybridization was 
carried out at 68°C for 16 h in hybridization buffer (Dig Easy Hyb Roche Diagnostics, Mannheim, Germany). Each blot 
was washed , blocked and detected as indicated in the standard protocol with the DIG Wash and Block Buffer set 
15 (Roche Diagnostics, Mannheim, Germany) and treated with 1 ml CSPD ready-to-use(Roche Diagnostics, Mannheim, 
Germany) for 15 min, 37°C and developed for 5 min on the Lumiimager (Roche). Finally, each blot was stripped (50 
% formamid,5% SDS, 50 mM Tris/HCI pH 7,5 ; 80° C, 2x 1 hour) and rehybridized with a GAPDH antisense RNA probe 
as an internal standard. 

20 Example 5: RNA Extraction and RT-PCR. 

[0038] RNA was prepared from different endothelial cell lines (HUVECS, HCAEC, HMVEC-L, HPAEC) using the 
TRIzol reagent (Hersteller, Lok.). Briefly, for each endothelial cell line, cells of a subconfluent 25 cm2 tissue culture 
flask were collected in 2,5ml TRIzol and total RNAs were extracted according to the supplied protocol. The purity of 
25 the RNA preparation was checked by veryfying the absence of genomic DNA. An aliquot of RNA, corresponding to 
-5u.g, was used for the cDNA generation using MMLV reverse transcriptase and the RT-PCR kit from STRATAGENE. 
RT-PCR was carried out in a volume of 50 uJ, the RT-PCR conditions were set to 65°C for 5 min, 1 5min at RT, 1 hour 
at 37°C, 5 min at 90°C, chill on ice. 

The cDNA templates for the PCR reactions (35 cycles of 94°C for 30 sec, 68°C for 3 min) were the reverse transcribed 
30 products of RNAs isolated from human endothelial cell lines (HUVECS.HCAEC, HMVEC-L, HPAEC). Typically, 1-5 u1 
of reverse transcribed cDNAs were used as templates for the PCR reactions. 

Example 6: Sources of materials. 

35 [0039] 1-oleoyl-LPA, sphingosin 1-phosphate (S1P), dihydrosphingosin 1-phosphate (DHS1P), ^phosphatidyl- 
choline (LPC), sphingosylphosphorylcholine (SPC) and fatty acid free BSA were from SIGMA (P.O.Box 14508, St. 
Louis, Missouri 63178). CHO-K1 cells were obtained from the American Type culture collection (ATCC, Manassas, 
Virginia), cell culture media and sera from GIBCO BRL (Gaithersburg, MD), the Ca fluorescent dye FLU04 and pluronic 
acid from Molecular devices (Sunnyvale CA 94089-1 1 36,USA) human northern blot membrane from CLONTECH (1 020 

40 East Meadow Circle, Palo Alto, California 94303-4230, USA.), commercially available cDNAs (heart, fetal heart, left 
atrium, left ventricle, kidney, brain, liver, lung, aorta) from Invitrogen, oligonucleotides from MWG-Biotech AG (Ebers- 
berg, Germany), the RT-PCR kit from SIGMA, the GC-melt PCR kit from Clontech (Palo Alto, CA), the expression 
plasmid pcDNA3.1 for EDG8 and pCDNA1.1 for expression of G-protein a subunits from Invitrogen (Carlsbad, CA 
92008), competent DH5a from GIBCO and MC 1063 from Invitrogen. 

45 
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List of non-standard abbreviations: 

[0041] S1P, sphingosine 1-phosphate; LPA, lysophosphatidic acid; dHS1P, dihydro sphingosine 1-phosphate; SPC, 
sphingosylphosphorylcholine; LPC, lysophosphatidylcholine; GPCR, G-protein-coupled receptor; G-protein, guanine 
nucleotide-binding protein; [Ca 2+ ]j, intracellular Calcium concentration, RT-PCR, reverse transcription polymerase 
chain reaction; bp, base pair; ORF, open reading frame; EST, expressed sequence tag; FAF-BSA, fatty acid free bovine 
serum albumine; HUVECS. Human umbilical vein endothelial cells; HCAEC, human coronary artery endothelial cells; 
HMVEC-L, human microvascular endothelial cells from lung; HPAEC, human pulmonary artery endothelial cells. 

Table 2: 

SEQ ID NO. 1 : Nucleotide sequence of human EDG8 

1 ATGGAGTCGGGGCTGCTGCGGCCGGCGCCGGTGAGCGAGGTCATCGTCCTGCATTACAAC 
61 TACACCGGCAAGCTCCGCGGTGCGCGCTACCAGCCGGGTGCCGGCCTGCGCGCCGACGCC 
121 GTGGTGTGCCTGGCGGTGTGCGCCTTCATCGTGCTAGAGAATCTAGCCGTGTTGTTGGTG 
181 CTCGGACGCCACCCGCGCTTCCACGCTCCCATGTTCCTGCTCCTGGGCAGCCTCACGTTG 
241 TCGGATCTGCTGGCAGGCGCCGCCTACGCCGCCAACATCCTACTGTCGGGGCCGCTCACG 
301 CTGAAACTGTCCCCCGCGCTCTGGTTCGCACGGGAGGGAGGCGTCTTCGTGGCACTCACT 
361 GCGTCCGTGCTGAGCCTCCTGGCCATCGCGCTGGAGCGCAGCCTCACCATGGCGCGCAGG 
421 GGGCCCGCGCCCGTCTCCAGTCGGGGGCGCACGCTGGCGATGGCAGCCGCGGCCTGGGGC 
481 GTGTCGCTGCTCCTCGGGCTCCTGCCAGCGCTGGGCTGGAATTGCCTGGGTCGCCTGGAC 
541 GCTTGCTCCACTGTCTTGCCGCTCTACGCCAAGGCCTACGTGCTCTTCTGCGTGCTCGCC 

601 TTCGTGGGCATCCTGGCCGCTATCTGTGCACTCTACGCGCGCATCTACTGCCAGGTACGC 
661 GCCAACGCGCGGGGCCTGCCGGCACGGCCCGGGACTGCGGGGACCACCTCGACCCGGGCG 
721 CGTCGCAAGCCGCGCTCGCTGGCCTTGCTGCGCACGCTCAGCGTGGTGCTCCTGGCCTTT 
7 8 1 GTGGCATGTTGGGGCCCCCTCTTCCTGCTGCTGTTGCTCGACGTGGCGTGCCCGGCGCGC 
841 ACCTGTCCTGTACTCCTGCAGGCCGATCCCTTCCTGGGACTGGCCATGGCCAACTCACTT 
90 1 CTGAACCCCATCATCTACACGCTCACCAACCGCGACCTGCGCCACGCGCTCCTGCGCCTG 
961 GTCTGCTGCGGACGCCACTCCTGCGGCAGAGACCCGAGTGGCTCCCAGCAGTCGGCGAGC 
1021 GCGGCTGAGGCTTCCGGGGGCCTGCGCCGCTGCCTGCCCCCGGGCCTTGATGGGAGCTTC 
1081 AGCGGCTCGGAGCGCTCATCGCCCCAGCGCGACGGGCTGGACACCAGCGGCTCCACAGGC 
1141 AGCCCCGGTGCACCCACAGCCGCCCGGACTCTGGTATCAGAACCGGCTGCAGACTGA 
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Table 3: 

SEQ ID NO. 2: Amino acid sequence of human EDG8 

5 

MESGLLRPAPVSEVIVLHYN 
10 YTGKLRGARYQPGAGLRADA 
VVCLAVCAFIVLENLAVLLV 
LGRHPRFHAPMFLLLGSLTL 
SDLLAGAAYAANILLSGPLT 
LKLSPALWFAREGGVFVA-L T 
ASVLSLLAIALERSLTMAR R 
GPAPVS S RGRTLAMAAAAWG 
VSLLLGLLP ALGWNCLGRLD 
ACSTVL PLYAKAYVLFCVLA 
FVGI LAAICALYARI YCQVR 
ANARRL PARPGTAGTTS TRA 
RRKPRSLALLRTLSVVLLAF 
VACWGPLFLLLLLDVACPAR 
35 TCPVLLQADP FLGLAMANSL 



15 



20 



25 



30 



LNPIIYTLTN 
VCCGRHSCGR 
AAEASGGL RR 
SGSERSSPQR 
SPGAPTAART 



RDLRHALLRL 
DPSGSQQSAS 
CLPPGLDGSF 
DGLDTSGSTG 
LVSEPAAD * 



a 55 
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Annex to the application documents - subsequently filed sequences linsting 
[0042] 

SEQUENCE LISTING 

<110> Aventis Pharma Deutschland GmbH 

<120> EDG8 receptor, its preparation and use 

<130> D-2000/A024 

<140> 00108858.2 
<141> 2000-04-26 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 1197 

<212> DNA 

<213> Homo sapiens 

<400> 1 

atggagtcgg ggctgctgcg gccggcgccg gtgagcgagg tcatcgtcct gcattacaac 60 
tacaccggca agctccgcgg tgcgcgctac cagccgggtg ccggcctgcg cgccgacgcc 120 
gtggtgtgcc tggcggtgtg cgccttcatc gtgctagaga atctagccgt gttgttggtg 180 
ctcggacgcc acccgcgctt ccacgctccc atgttcctgc tcctgggcag cctcacgttg 240 
tcggatctgc tggcaggcgc cgcctacgcc gccaacatcc tactgtcggg gccgctcacg 300 
ctgaaactgt cccccgcgct ctggttcgca cgggagggag gcgtcttcgt ggcactcact 360 
gcgtccgtgc tgagcctcct ggccatcgcg ctggagcgca gcctcaccat ggcgcgcagg 420 
gggcccgcgc ccgtctccag tcgggggcgc acgctggcga tggcagccgc ggcctggggc 480 
gtgtcgctgc tcctcgggct cctgccagcg ctgggctgga attgcctggg tcgcctggac 540 
gcttgctcca ctgtcttgcc gctctacgcc aaggcctacg tgctcttctg cgtgctcgcc 600 
ttcgtgggca tcctggccgc tatctgtgca ctctacgcgc gcatctactg ccaggtacgc 660 
gccaacgcgc ggcgcctgcc ggcacggccc gggactgcgg ggaccacctc gacccgggcg 720 
cgtcgcaagc cgcgctcgct ggccttgctg cgcacgctca gcgtggtgct cctggccttt 780 
gtggcatgtt ggggccccct cttcctgctg ctgttgctcg acgtggcgtg cccggcgcgc 84 0 
acctgtcctg tactcetgca ggccgatccc ttcctgggac tggccatggc caactcactt 900 
ctgaacccca tcatctacac gctcaccaac cgcgacctgc gccacgcgct cctgcgcctg 960 
gtctgctgcg gacgccactc ctgcggcaga gacccgagtg gctcccagca gtcggcgagc 1020 
gcggctgagg cttccggggg cctgcgccgc tgcctgcccc cgggccttga tgggagcttc 1080 
agcggctcgg agcgctcatc gccccagcgc gacgggctgg acaccagcgg ctccacaggc 1140 
agccccggtg cacccacagc cgcccggact ctggtatcag aaccggctgc agactga 1197 

<210> 2 
<211> 398 
<212> PRT 

<213> Homo sapiens 
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<400> 2 

Met Glu Ser Gly Leu Leu Arg Pro Ala Pro Val Ser Glu Val He Val 
15 10 15 

Leu His Tyr Asn Tyr Thr Gly Lys Leu Arg Gly Ala Arg Tyr Gin Pro 
20 25 30 

Gly Ala Gly Leu Arg Ala Asp Ala Val Val Cys Leu Ala Val Cys Ala 
35 40 45 

Phe He Val Leu Glu Asn Leu Ala Val Leu Leu Val Leu Gly Arg His 
50 55 60 

Pro Arg Phe His Ala Pro Met Phe Leu Leu Leu Gly Ser Leu Thr Leu 
65 70 75 80 

Ser Asp Leu Leu Ala Gly Ala Ala Tyr Ala Ala Asn He Leu Leu Ser 
85 90 95 

Gly Pro Leu Thr Leu Lys Leu Ser Pro Ala Leu Trp Phe Ala Arg Glu 
100 105 110 

Gly Gly Val Phe Val Ala Leu Thr Ala Ser Val Leu Ser Leu Leu Ala 
115 120 125 

He Ala Leu Glu Arg Ser Leu Thr Met Ala Arg Arg Gly Pro Ala Pro 
130 135 140 

Val Ser Ser Arg Gly Arg Thr Leu Ala Met Ala Ala Ala Ala Trp Gly 
145 150 155 160 

Val Ser Leu Leu Leu Gly Leu Leu Pro Ala Leu Gly Trp Asn Cys Leu 
165 170 175 

Gly Arg Leu Asp Ala Cys Ser Thr Val Leu Pro Leu Tyr Ala Lys Ala 
180 185 190 

Tyr Val Leu Phe Cys Val Leu Ala Phe Val Gly He Leu Ala Ala lie 
195 200 205 

Cys Ala Leu Tyr Ala Arg He Tyr Cys Gin Val Arg Ala Asn Ala Arg 
210 215 220 

Arg Leu Pro Ala Arg Pro Gly Thr Ala Gly Thr Thr Ser Thr Arg Ala 
225 230 235 240 

Arg Arg Lys Pro Arg Ser Leu Ala Leu Leu Arg Thr Leu Ser Val Val 
245 250 255 
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10 



15 



20 



30 



35 



Leu Leu Ala Phe Val Ala Cys Trp Gly Pro Leu Phe Leu Leu Leu Leu 

260 265 270 

Leu Asp Val Ala Cys Pro Ala Arg Thr Cys Pro Val Leu Leu Gin Ala 

275 280 285 

Asp Pro Phe Leu Gly Leu Ala Met Ala Asn Ser Leu Leu Asn Pro He 
290 295 300 

He Tyr Thr Leu Thr Asn Arg Asp Leu Arg His Ala Leu Leu Arg Leu 

305 310 315 320 

Val Cys Cys Gly Arg His Ser Cys Gly Arg Asp Pro Ser Gly Ser Gin 

325 330 335 

Gin Ser Ala Ser Ala Ala Glu Ala Ser Gly Gly Leu Arg Arg Cys Leu 

340 345 350 



Pro Pro Gly Leu Asp Gly Ser Phe Ser Gly Ser Glu Arg Ser Ser Pro 

25 355 360 365 

Gin Arg Asp Gly Leu Asp Thr Ser Gly Ser Thr Gly Ser Pro Gly Ala 
370 375 380 



Pro Thr Ala Ala Arg Thr Leu Val Ser Glu Pro Ala Ala Asp 
385 390 395 



Claims 

1. An isolated polynucleotide comprising a nucleotide sequence that has at least 80 % identity to a nucleotide se- 
40 quence encoding the polypeptide ofSEQ ID NO. 2 or the corresponding fragment thereof; or a nucleotide sequence 

complementary to said nucleotide sequence. 

2. The polynucleotide of claim 1 which is DNA or RNA. 

45 3. The polynucleotide of claim 1 wherein said nucleotide sequence is at least 80 % identical to that contained in SEQ 
ID NO. 1. 

4. The polynucleotide of claim 3 wherein said nucleotide sequence is contained in SEQ ID NO. 1 . 

50 5. The polynucleotide with sequence SEQ ID NO. 1 . 

6. The polynucleotide of claim 1 wherein said encoding nucleotide sequence encodes the polypeptide of SEQ ID 
NO. 2 or a fragment thereof. 

55 7. EDG8 DNA or RNA molecule comprising an expression system wherein said expression system is capable of 
producing a polypeptide or a fragment thereof having at least 80 % identity with a nucleotide sequence encoding 
the polypeptide of SEQ ID NO. 2 or said fragment when said expression system is present in a compatible host cell. 
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A host cell comprising the expression system of claim 7. 

A process for producing a EDG8 polypeptide or fragment comprising culturing a host claim 8 and under conditions 
sufficient for the production of said polypeptide or fragment. 

The process of claim 9 wherein said polypeptide or fragment is expressed at the surface of said cell. 
Cells produced by the process of claim 10. 

The process of claim 9 which further includes recovering the polypeptide or fragment from the culture. 

A process for producing a cell which produces a EDG8 polypeptide or a fragment thereof comprising transforming 
or transfecting a host cell with the expression system of claim 7 such that the host cell, under appropriate culture 
conditions, produces a EDG8 polypeptide or fragment. 

EDG8 polypeptide or a fragment thereof comprising an amino acid sequence which is at least 80 % identical to 
the amino acid sequence contained in SEQ ID NO. 2. 

Polypeptide of claim 14 which comprises the amino acid sequence of SEQ ID NO. 2, or a fragment thereof. 
EDG8 Polypeptide or fragment prepared by the method of claim 12. 

A process for diagnosing a disease or a susceptibility to a disease related to expression or acitivity of EDG8 
polypeptide comprising: 

c) determining the presence or absence of mutation in the nucleotide sequence encoding said EDG8 polypep- 
tide in the genome of said subject; and/or 

d) analyzing for the presence or amount of the EDG8 polypeptide expression in a sample derived from said 
subject. 

A method for identifying compounds which bind to EDG8 polypeptide comprising: 

c) contacting a cell as claimed in claim 11 or a part thereof with a candidate compound; and 

d) assessing the ability of said candidate compound to bind to said cells. 

The method of claim 18 which further includes determining whether the candidate compound effects a signal 
generated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate compound which 
effects production of said signal is identified as an agonist. 

The method of claim 18 which further includes determining whether the candidate compound effects a signal 
generated by activation of the EDG8 polypeptide at the surface of the cell, wherein a candidate compound which 
effects production of said signal is identified as an antagonist. 

21. An agonist identified by the method of claim 19. 

45 

22. An antagonist identified by the method of claim 20. 

23. The method of claim 18 which further includes contacting said cell with a known agonist for said EDG8 polypeptide; 
and determining whether the signal generated by said agonist is diminished in the presence of said candidate 

50 compound, wherein a candidate compound which effects a diminution in said signal is identified as an antagonist 

for said EDG8 polypeptide. 

24. A method as claimed in claim 23, wherein the known agonist is S1p, LPA and/or DHS1P. 
55 25. An antagonist identified by the method of claim 23 or 24. 

26. Method of preparing a pharmaceutical composition comprising 



8. 
9. 

5 

10. 

11. 

10 12. 
13. 

15 

14. 
15. 

20 

16. 

17. 

25 
30 

18. 

35 

19. 

40 20. 
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a) identifying a compound which is an agonist or an antagonist of EDG8, 

b) preparing the compound, and 

c) optionally mixing the compound with suitable additives. 
5 27. Pharmaceutical compound prepared by a process of claim 26. 



10 



15 



20 



25 



30 



35 



40 



45 



50 



14 



EP1 149 907 A1 



1A: 

1 ATGGAGTCGGGGCTGCTGCGGCCGGCGCCGGTGAGCGAGGTCATCGTCCTGCATTACAAC 
MESGLLRPAPVSEVIVLHYN 

6 1 TACACCGGCAAGCTCCGCGGTGCGCGCTACCAGCCGGGTGCCGGCCTGCGCGCCGACGCC 
YTGKLRGARYQPGAGLRADA 

121 GTGGTGTGCCTGGCGGTGTGCGCCTTCATCGTGCTAGAGAATCTAGCCGTGTTGTTGGTG 
VVCLAVCAF IVLENLAVLLV 

181 CTCGGACGCCACCCGCGCTTCCACGCTCCCATGTTCCTGCTCCTGGGCAGCCTCACGTTG 
LGRHPRFHAPMFLLLGSLTL 

241 TCGGATCTGCTGGCAGGCGCCGCCTACGCCGCCAACATCCTACTGTCGGGGCCGCTCACG 
SDLLAGAAYAANILL5GPLT 

301 CTGAAACTGTCCCCCGCGCTCTGGTTCGCACGGGAGGGAGGCGTCTTCGTGGCACTCACT 
LKLSPALWFAREGGVFVALT 

361 GCGTCCGTGCTGAGCC7CCTGGCCATCGCGCTGGAGCGCAGCCTCACCATGGCGCGCAGG 
ASVLSLLAIALERSLTMARR 

4 21 GGGCCCGCGCCCGTCTCCAGTCGGGGGCGCACGCTGGCGATGGCAGCCGCGGCCTGGGGC 
GPAPVSSRGRTLAMAAAAWG 

4 81 GTGTCGCTGCTCCTCGGGCTCCTGCCAGCGCTGGGCTGGAATTGCCTGGGTCGCCTGGAC 
VSLLLGLLPALGWNCLGRLD 

541 GCTTGCTCCACTGTCTTGCCGCTCTACGCCAAGGCCTACGTGCTCTTCTGCGTGCTCGCC 
AC3TVLPLYAKAYVLFCVLA 

601 TTCGTGGGCATCCTGGCCGCTATCTGTGCACTCTACGCGCGCATCTACTGCCAGGTACGC 
FVGILAAICALYARI YCQVR 

661 GCCAACGCGCGGCGCCTGCCGGCACGGCCCGGGACTGCGGGGACCACCTCGACCCGGGCG 
ANARRLPARPGTAGTTSTRA 

721 CGTCGCAAGCCGCGCTCGC7GGCCTTGCTGCGCACGCTCAGCGTGGTGCTCCTGGCCTTT 
RRKPRSLALLRTLSVVLLAF 

781 GTGGCATGTTGGGGCCCCCTCTTCCTGCTGCTGTTGCTCGACGTGGCGTGCCCGGCGCGC 
VACWGPLFLLLLLDVACPAR 

841 ACCTGTCCTGTACTCCTGCAGGCCGATCCCTTCCTGGGACTGGCCATGGCCAACTCACTT 
TC PVLLQAD PFLGLAMANS L 

901 CTGAACCCCATCATCTACACGCTCACCAACCGCGACCTGCGCCACGCGCTCCTGCGCCTG 
LNPIIYTLTNRDLRHALLRL 

961 GTCTGCTGCGGACGCCACTCCTGCGGCAGAGACCCGAGTGGCTCCCAGCAGTCGGCGAGC 
VCCGRHSCGRDPSG5QQSAS 

1021 GCGGCTGAGGCTTCCGGGGGCCTGCGCCGCTGCCTGCCCCCGGGCCTTGATGGGAGCTTC 
AAEASGGLRRCLPPGLDGS F 

1081 AGCGGCTCGGAGCGCTCATCGCCCCAGCGCGACGGGCTGGACACCAGCGGCTCCACAGGC 
SGSERSS PQRDGLDTSGSTG 

1141 AGCCCCGGTGCACCCACAGCCGCCCGGACTCTGGTATCAGAACCGGCTGCAGACTGA 
S PGAPTAARTLVSEPAAD * 
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edg2_sheep 
edg2_bovin 
edg2_mouse 
- edg_xenopus 

edg7_human 
edg4_human 

edg6_mouse 



- edg6_human 
r nrgl_rat 
edg8_hutnan 
edg5_rat 
L edg5_human 
edgl_rat 
edgl_mouse 
_| L edgl_human 

- edg_zebraf ish 
i — edg3_human 

— edg3_fugu 



edg2_human 
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FIG1C: 



1 60 

edg2_human MAAISTSIPV ISQPQFTAMN EPQCFYNESI AFFYNRSGKH LAT.EWNTVS KLVMGL . . GI 

edg7_human MN E . . CHYDKHM DFFYNRSNTD TVD. DW.TGT KLVIVLCVGT 

edg4_human MVI MGQCYYNETI GFFYNNSGKE LSS.HWR..P KDVWVALGL 

edgljiuraan MGPTS VPLVKAHRSS VSDYVNYDII VRHYNYTGKL . .NISADKEN SIKLTSWFI 

edg3_human MATALPPR LQPVRGNETL REHYQYVGKL AGRLKEASEG S.TLTTVLFL 

edg5_human MGSL YSSYLNPNKV QEHYNYTKE. . .TLETQETT SRQVASAFIV 

edg8Jluman MESGL LRPAPVSEVI VLHYNYTGKL RG.ARYQPGA GLRADAVVCL 

edg6Juiman MNATG TPVAPESCQQ LAAGGHSRLI VLHYNHSGRL AGR.GGPEDG GLGALRGLSV 



edg2 

edg7[ 

edg4 

edgl] 

edg3 

edg5" 

edg8_ 

edg6_ 



human 
human 
human 
human 
human 
human 
human 
human 



61 

TVCIFIMLAN 
FFCLFI FFSN 
TVSVLVLLTN 
LICCFIILEN 
VICSFIVLEN 
ILCCAIVVEN 
AVCAFIVLEN 
AASCLVVLEN 



LLVMVAIYVN 
SLVIAAVIKN 
LLVIAAIAStf 
IFVLLTIWKT 
LMVLIAIWKN 
LLVLIAVARN 
LAVLLVLGRH 
LLVLAAITSH 



RRFHFPIYYL 
RKFHFPFYYL 
RRFHQPIYYL 
KKFHRPMYYF 
NKFHNRMYFF 
SKFHSAMYLF 
PRFHAPMFLL 
MRSRRWVYYC 



KANLAAADFF 
LANLAAADFF 
LGNLAAADLF 
IGNLALSDLL 
IGNLALCDLL 
LGNLAASDLL 
LGSLTLSDLL 
LVNITLSDLL 



AGLAYFYLHF 
AGIAYVFLMF 
AGVAYLFLMF 
AGVAYTANLL 
AGIAYKVNIL 
AGVAFVANTL 
AGAAYAANIL 
TGAAYLANVL 



120 

NTGPNTRRLT 
NTGPVSKTLT 
HTGPRTARLS 
LSGATTYKLT 
MSGKKTFSLS 
LSGSVTLRLT 
LSGPLTLKLS 
LSGARTFR1A 



edg2 

edg7_ 

edg4 

edgf 

edg3 

edg5~ 

edg8~ 

edg6_ 



human 
human 
human 
human 
human 
human 
human 
human 



121 

VSTWLLRQGL 
VNRWFLRQGL 
LEGWFLRQGL 
PAQWFLREGS 
PTVWFLREGS 
PVQWFAREGS 
PALWFAREGG 
PAQWFLREGL 



IDTSLTASVA 
LDSSLTASLT 
LDTSLTASVA 
MFVALSASVF 
MFVALGASTC 
ASITLSASVF 
VFVALTASVL 
LFTALAASTF 



NLLAIAIERH 
flLLVIAVERH 
TLLAIAVERH 
SLLAIAIERY 
SLLAIAIERH 
SLLAIAIERH 
SLLAIALERS 
SLLFTAGERF 



ITVFR.MQLH 
MSIMR.MRVH 
RSVMA . VQLH 
ITMLK.MKLH 
LTMIK.MRPY 
VAIAK. VKLY 
LTMAR.RGPA 
ATMVRPVAES 



TRMSNRRVW 
SNLTKKRVTL 
SRLPRGRWM 
NGSNNFRLFL 
DANKRHRVFL 
GSDKSCRMLL 
PVSSRGRTLA 
GATKTSRVYG 



180 

VIWIWTMAI 
LILLVWAIAI 
LIVGVWVAAL 
LISACWVISL 
LIGMCWLIAF 
LIGASWLISL 
MAAAAWGVSL 
FIGLCWLLAA 



edg2 

edg7; 

edg4 

edgl~ 

edg3_ 

edg5_ 

edg8~ 

edg6_ 



human 
human 
human 
human 
human 
human 
human 
human 



181 

VMGAIPSVGW 
FMGAVPTLGW 
GLGLLPAHSW 
ILGGLPIMGW 
TLGALPILGW 
VLGGLPILGW 
LLGLLPALGW 
LLGMLPLLGW 



NCICDIENCS 
NCLCNISACS 
HCLCALDRCS 
NCI SALS SCS 
NCLHNLPDCS 
NCLGHLEACS 
NCLGRLDACS 
NCLCAFDRCS 



NMAPLYSDSY 
SLAPIYSRSY 
RMAPLLSRSY 
TVLPLYHKHY 
TILPLYSKKY 
TVLPLYAKHY 
TVLPLYAKAY 
SLLPLYSKRY 



LVFWAIFNLV 
LVFWTVSNLM 
LAVWALSSLL 
ILFCTTVFTL 
IAFCISIFTA 
VLCVVTIFSI 
VLFCVLAFVG 
ILFCLVIFAG 



TFVVMWLYA 
AFLIMWVYL 
VFLLMVAVYT 
LLLSIVILYC 
ILVTIVILYA 
ILLAIVALYV 
ILAAICALYA 
VLATIMGLYG 



240 

HIFGYVRQRT 
RIYVYVKRKT 
RIFFYVRRRV 
RIYSLVRTRS 
RIYFLVKSSS 
RIYCWRSSH 
RIYCQVRANA 
AIFRLVQASG 



241 300 

edg2_human MRMSRHSSGP R RNR DTMMSLLKTV VIVLGAFIIC WTPGLVLLLL D.VCCP..QC 

edg7 human NVLSPHTSGS I SRR RTPMKLMKTV MTVLGAFWC WTPGLWLLL DGLNCR. .QC 

edg4~huraan QRMAEHVSCH P RYR ETTLSLVKTV VIILGAFVVC WTPGQVVLLL DGLGCE..SC 

edgl~human RRLTFR KNISKASRS SENVALLKTV IIVLSVFIAC WAPLFILLLL DV.GCKVKTC 

edg3 human RKVANH m S ERSMALLRTV VIWSVFIAC WSPLFILFLI DV.ACRVQAC 

edgSJmman ADMA. A PQTLALLKTV TIVLGVFIVC WLPAFSILLL DY.ACPVHSC 

edg8_human RRLPARPGTA GTTSTRARRK PRSLALLRTL SWLLAFVAC WGPLFLLLLL DV.ACPARTC 

edg6_human QKAP RPAARRK ARR. .LLKTV LMILLAFLVC WGPLFGLLLA DVFGSNLWAQ 



edg2 
edg7] 
edg4 
edgl" 
edg3_ 
edg5_ 
edg8] 
edg6_ 



edg2 
edg7" 
edg4^ 
edgl 
edg3] 
edg5_ 
edg8~ 
edg6_ 



human 
human 
human 
human 
human 
human 
human 
human 



human 
human 
human 
human 
human 
human 
human 
human 



301 360 

DVLAYEKFFL LLAEFNSAMN PIIYSYRDKE MSATFRQILC CQRSENPTGP TESSDRSASS 

GVQHVKRWFL LLALLNSWN ?II YSYKDED MYGTMKKMIC CFSQENP ERRPSR 

NVLAVEKYFL LLAEANSLVN AAVYSCRDAE MRRTFRRLLC CACLRQSTRE SVHYTSSAQG 

DILFRAEYFL VLAVLNSGTN PIIYTLTKKE MRRAFIRIMS CCKCPSGO S 

PILFKAQWFI VLAVLNSAMN PVI YTLASKE MRRAFFRLV . .CNC.LVR G 

PILYKAHYFF AVSTLNSLLN PVIYTWRSRD LRREVLRPLQ CWRPGVGV Q 

PVLLQADPFL GLAMANSLLN PIIYTLTNRD LRHALLRLVC CGRHSCGRDP SGS..QQSAS 

EYLRGMDWIL ALAVLNSAVN PIIYSFRSRE VCRAVLSFLC CGCLRLGMRG PGDCLARAVE 

361 416 
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